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Abstract 

We  develop  a  novel  autonomous  control  synthesis  strategy  called  Phase  Space  Navi¬ 
gator  for  nonlinear  control  systems,  with  which  a  controller  for  a  nonlinear  system  can  be 
automatically  synthesized  in  phase  spaces.  The  Phase  Space  Navigator  generates  con¬ 
trol  laws  by  synthesizing  flow  shapes  of  dynamical  systems  and  planning  and  navigating 
system  trajectories  in  the  phase  spaces.  More  specifically,  the  control  synthesis  strategy 
consists  of  a  global  control  path  planner,  a  local  trajectory  generator,  and  a  reference 
trajectory  follower.  The  global  path  planner  finds  optimal  paths  from  an  initial  state  to 
the  goal  state  in  the  phase  space,  consisting  of  a  sequence  of  path  segments  connected 
at  intermediate  points  where  the  control  parameter  changes.  A  brute-force,  fine-grain 
search  in  high-dimensional  phase  spaces  would  be  prohibitively  expensive.  Modeling 
and  parsing  phase  spaces  into  trajectory  flow  pipes  provide  a  way  to  efficiently  reason 
about  the  phase  space  structures  and  search  for  global  control  paths.  The  local  trajec¬ 
tory  generator  uses  the  flow  information  about  the  phase  space  trajectories  to  produce 
smoothed  trajectories.  The  trajectory  follower  tracks  the  planned  reference  trajectory, 
reactively  corrects  deviations,  and  resynthesizes  the  reference  trajectory  if  the  dynamics 
of  the  system  changes  significantly. 

We  have  demonstrated  the  strategy  with  a  program  that  automatically  synthesizes 
global  control  paths  for  stabilizing  a  steel  column  buckling  under  compression.  The 
Phase  Space  Navigator  is  particularly  suitable  for  synthesizing  high-performance  control 
systems  that  do  not  lend  themselves  to  traditional  design  and  analysis  techniques.  It 
can  also  assist  control  engineers  in  exploring  much  larger  design  spaces  than  otherwise 
possible. 
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Abstract 

We  develop  a  novel  autonomous  control  synthesis  strategy  called  Phase 
Space  Navigator  for  nonlinear  control  systems,  with  which  a  controller  for 
a  nonlinear  system  can  be  automatically  synthesized  in  phase  spaces.  The 
Phase  Space  Navigator  generates  control  laws  by  synthesizing  flow  shapes  of 
dynamical  systems  and  planning  and  navigating  system  trajectories  in  the 
phase  spaces.  More  specifically,  the  control  synthesis  strategy  consists  of 
a  global  control  path  planner,  a  local  trajectory  generator,  and  a  reference 
trajectory  follower.  The  global  path  planner  finds  optimal  paths  from  ai[ 
initial  state  to  the  goal  state  in  the  phase  space,  consisting  of  a  sequence  o 
path  segments  connected  at  intermediate  points  where  the  control  parameter 
changes.  A  brute-force,  fine-grain  search  in  high-dimensional  phase  space; 
would  be  prohibitively  expensive.  Modeling  and  parsing  phase  spaces  intc 
trajectory  flow  pipes  provide  a  way  to  efficiently  reason  about  the  phase 
space  structures  and  search  for  global  control  paths.  The  local  trajectorj 
generator  uses  the  flow  information  about  the  phase  space  trajectories  tc 
produce  smoothed  trajectories.  The  trajectory  follower  tracks  the  planned 
reference  trajectory,  reactively  corrects  deviations,  and  resynthesizes  the 
reference  trajectory  if  the  dynamics  of  the  system  changes  significantly. 

We  have  demonstrated  the  strategy  with  a  program  that  automatical¬ 
ly  synthesizes  global  control  paths  for  stabilizing  a  steel  column  buckling 
under  compression.  The  Phase  Space  Navigator  is  particularly  suitable  for 
synthesizing  high-performance  control  systems  that  do  not  lend  themselves 
to  traditional  design  and  analysis  techniques.  It  can  also  assist  control  en¬ 
gineers  in  exploring  much  larger  design  spaces  than  otherwise  possible. 
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1  Introduction 


We  develop  a  novel  autonomous  control  synthesis  strategy  called  Phase  Space  Nav¬ 
igator  for  nonlinear  control  systems,  with  which  a  controller  for  a  nonlinear  system 
can  be  automatically  synthesized  in  phase  spaces.  The  Phase  Space  Navigator  gen¬ 
erates  control  laws  by  synthesizing  flow  shapes  of  dynamical  systems  and  planning 
and  navigating  system  trajectories  in  the  phase  spaces.  It  is  particularly  suitable 
for  synthesizing  high-performance  control  systems  that  do  not  lend  themselves  to 
traditional  design  and  analysis  techniques.  It  can  also  assist  control  engineers  in 
exploring  much  larger  design  spaces  than  otherwise  possible. 

The  control  synthesis  strategy  consists  of  a  global  control  path  planner,  a  local 
trajectory  generator,  and  a  reference  trajectory  follower.  The  global  path  plan¬ 
ner  finds  optimal  paths  from  an  initial  state  to  the  goal  state  in  the  phase  space, 
consisting  of  a  sequence  of  path  segments  connected  at  intermediate  points  where 
the  control  parameter  changes.  A  brute- force,  fine-grain  search  in  high-dimensional 
phase  spaces  would  be  prohibitively  expensive.  High-level  descriptions  of  the  phase 
space  and  trajectory  flows  provide  a  way  to  efficiently  reason  about  phase  space 
structures  and  search  for  global  control  paths.  The  local  trajectory  generator  us¬ 
es  the  flow  information  about  the  phase  space  trajectories  to  produce  smoothed 
trajectories.  The  trajectory  follower  tracks  the  planned  reference  trajectory,  reac- 
tively  corrects  deviations,  and  resynthesizes  reference  trajectory  if  the  dynamics  of 
the  system  changes  significantly.  The  control  synthesis  program  employs  various 
techniques  such  as  phase  space  cut-and-paste  operation,  programmed  damping  and 
switching,  and  piecewise-linear  approach  [16]. 

The  strategy  relies  on  the  phase  space  knowledge  obtained  from  an  autonomous 
phase  space  analysis  and  modeling  program  called  MA.PS1  that  extracts  and  rep¬ 
resents  qualitative  phase  space  structures  of  nonlinear  dynamical  systems  of  any 
order  [17].  It  is  made  possible  by  the  geometric  and  dynamical  modeling  of  the 
phase  space  behaviors  in  so-called  flow  pipes.  MAPS  generates  a  high-level  de¬ 
scription  about  a  dynamical  system  describing  equilibrium  points  and  limit  cycles, 
geometries  of  stability  regions  and  trajectory  flows,  and  their  spatial  arrangements 
and  interactions  in  the  phase  space  in  a  relational  graph.  The  high-level  description 
is  sensible  to  human  beings  and  manipulate  by  other  programs. 

The  phase  space  description  can  assist  engineers  in  designing  controllers  for 
complex  systems.  We  envision  that  a  control  engineer  uses  MAPS  to  observe 
qualitative  changes  of  phase  space  structures  when  varying  control  parameters:  (1) 
the  ‘birth,  disappearance,  and  movement  of  equilibrium  points  and  limit  cycles; 
(2)  the  enlargement  and  shrinking  of  stability  regions;  and  (3)  other  phase  space 
qualitative  changes.  The  engineer  can  visualize  the  qualitative  phase  portrait  with 
graphics  rendering  tools,  interactively  edit  the  system  to  observe  corresponding 
changes,  and  cut-and-paste  together  useful  portions  of  phase  portraits  so  that  the 


*MAPS  stands  for  Modeler  and  Analyzer  for  Phase  Spaces. 


composite  has  the  desired  properties. 

In  this  paper,  we  automate  the  above  scenario  with  the  Phase  Space  Navigator 
that  autonomously  synthesizes  nonlinear  controllers  in  phase  spaces.  More  impor¬ 
tantly,  the  method  will  be  able  to  synthesize  a  nonlinear  control  system  whose 
phase  space  is  high  dimensional  and  difficult  to  visualize,  or  on  which  the  desired 
control  properties  are  impossible  to  obtain  with  traditional  design  techniques.  The 
topological  and  dynamical  modeling  of  the  phase  space  stability  regions  and  trajec¬ 
tory  flows  forms  the  basis  for  our  method. 

2  Automatic  Control  Synthesis  in  Phase  Spaces 

The  Phase  Space  Navigator  synthesizes  a  control  system  from  a  geometric  point  of 
view  in  the  phase  space.  The  control  of  a  dynamical  system  is  interpreted  as  the 
“steering”  of  a  state  trajectory,  emanating  from  some  initial  state,  to  the  desired 
state  by  a  control  signal. 

2.1  Intelligent  Navigation  in  Phase  Spaces 

The  control  objective  for  a  stabilization  problem  is  to  synthesize  a  control  path  so 
that  the  physical  system  is  brought  to  the  goal  state  along  the  prespecified  path 
and  stays  there  afterwards  under  certain  control.  We  are  particularly  interested 
in  cases  in  which  physical  plants  to  be  controlled  operate  in  large  regions  where 
nonlinearities  of  the  plants  can  not  be  ignored.  When  no  global  stabilization  con¬ 
trol  laws  can  be  found  for  the  desired  state  with  traditional  control  techniques, 
composite  control  paths  have  to  be  synthesized.  Geometrically  speaking,  this  is 
the  case  where  initial  states  of  the  systems  are  far  away  from  the  desired  state. 
Phase  Space  Navigator  takes  advantage  of  the  underlying  dynamics  of  the  phase 
space  flows  to  plan  the  trajectory  both  locally  and  globally  and  switches  control 
at  carefully  planned  time  instances  and  places  [16].  This  type  of  control  via  phase 
space  path  planning  dictates  little  control  authority  and  achieves  the  desired  con¬ 
trol  properties  otherwise  impossible  to  obtain  or  difficult  to  manually  synthesize. 
It  is  a  small,  opportunistic  dynamical  alteration  based  on  the  global  knowledge 
of  phase  space  structures.  As  soon  as  the  system  enters  the  neighborhood  of  the 
desired  state,  a  local  linear  controller  stabilizes  the  system  at  the  desired  location. 

2.2  Parsing  Phase  Spaces  into  Flow  Pipes 

Many  nonlinear  control  systems  do  not  have  closed,  analytic  form  solutions.  Com¬ 
putational  exploration  is  often  the  most  common  means  for  designing  and  analyzing 
global  nonlinear  controllers.  Since  a  phase  space  is  a  union  of  an  infinite  number 
of  trajectories,  an  exhaustive  search  for  plausible  control  paths  in  the  space  of  all 
the  trajectories  is  computationally  too  expensive  and  numerically  too  sensitive  to 


3 


(a)  (b) 

Figure  1:  Grouping  trajectories  into  flow  pipes:  (a)  a  collection  of  trajectories  with 
the  same  qualitative  feature;  (b)  a  corresponding  flow  pipe. 

uncertainties.  We  introduce  the  novel  idea  of  flow  pipes  to  model  trajectory  flows 
in  a  phase  space. 

The  trajectory  flows  are  modeled  with  flow  pipes  that  describe  the  shape  and 
direction  of  the  flows.  (The  construction  of  the  flow  pipes  is  to  be  discussed 
in  Section  4.)  Figure  1(a)  shows  a  portion  of  the  phase  space  for  the  Lienard 
equation  [16],  consisting  of  all  trajectories  ending  at  the  attractor  denoted  by  the 
symbol  +.  A  flow  pipe  groups  this  collection  of  trajectories  that  exhibit  the  same 
qualitative  feature  into  an  equivalence  class,  as  shown  in  Figure  1(b). 

The  Phase  Space  Navigator  searches  in  a  much  smaller  space  of  equivalence 
classes  of  trajectories,  the  flow  pipes.  The  expensive  fine-grain  search  in  phase 
spaces,  possibly  of  high  dimensions,  is  avoided.  Furthermore,  the  flow  pipe  is  a 
natural  representation  with  respect  to  robustness  to  the  effect  of  noises  and  uncer¬ 
tainties,  compared  with  individual  trajectories.  Often,  the  synthesized  reference 
trajectory  has  to  be  modified  to  accommodate  uncertainties  in  the  model.  The 
flow  pipe  provides  room  for  a  small  deformation  of  the  reference  trajectory  within 
the  flow  pipe  at  runtime. 

2.3  Two-Stage  Automatic  Control  Synthesis 

The  Phase  Space  Navigator  has  two  main  modules:  the  planning  module  and 
the  tracking  module.  The  planning  module  synthesizes  a  desired  trajectory,  the 
reference  trajectory,  in  the  phase  space  based  on  the  dynamics  of  the  nominal 
model.  The  tracking  module  follows  the  reference  trajectory  and  reactively  corrects 
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Figure  2:  The  Phase  Space  Navigator 

local  deviations.  The  parameters  of  the  nominal  model  are  estimated  at  runtime 
and  the  model  is  updated.  When  the  dynamics  changes  significantly,  the  reference 
trajectory  is  resynthesized.  Figure  2  shows  the  interplay  between  the  two  modules 
in  the  context  of  controlling  a  real  physical  plant. 

2.3.1  Reference  trajectory  generation 

The  planning  module  consists  of  a  global  path  planner  and  a  local  hajectory  gener¬ 
ator.  The  global  path  planner  finds  coarse  global  paths  from  the  initial  state  to  the 
goal  state,  utilizing  the  flow  pipes  of  trajectories.  The  local  trajectory  generator 
fits  smooth  trajectory  segments  into  the  global  path  by  slicing  out  trajectory  seg¬ 
ments  from  flow  pipes  or  through  local  trajectory  deformation.  Figure  3  illustrates 
the  data  flows  among  components  of  the  planning  module. 

(1)  Global  Path  Planner 

The  global  path  planner  searches  for  optimal  paths  from  an  initid  state  to  a 
goal  state  in  the  phase  space,  subject  to  design  constraints  such  as  fast  convergence 
and  little  overshooting.  The  high-level  qualitative  description  of  the  phase  space 
guides  the  search. 

To  synthesize  a  global  path,  MAPS  first  explores  in  a  set  of  phase  portraits 
indexed  by  the  selected  values  of  the  control  signal.  It  generates  a  sequence  of 
phase  space  descriptions  summarized  in  a  geometric  vocabulary.  Flow  pipes  are 
then  extracted  from  the  descriptions.  The  global  planner  searches  for  a  pipe  path 
—  a  sequence  of  flow  pipes  interconnected  at  intermediate  points  —  from  the  initial 
state  to  the  goal  state  in  this  collection  of  flow  pipes. 
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Figure  3:  Reference  trajectory  generation 

For  stabilization  problems,  we  need  to  synthesize  an  attractor  at  the  goal  state. 
When  the  system  enters  the  neighborhood  of  the  goal  state,  a  linear  feedback 
controller  is  switched  in  to  stabilize  the  system  at  the  goal  state. 

(2)  Local  Trajectory  Generator 

The  local  trajectory  generator  produces  smoothed  trajectories  connecting  in¬ 
termediate  points.  More  specifically,  trajectory  segments  are  extracted  from  the 
flow  pipes  forming  the  global  path.  A  trajectory  segment  is  extracted  from  a  flow 
pipe  and  is  continuously  deformed  to  connect  intermediate  points  in  the  path.  The 
local  trajectory  generator  also  smoothes  out  sharp  interconnections  of  trajectory 
segments  to  reduce  undesired  transients,  with  local  linear  controllers  or  local  sliding 
surface  insertion  [14].  The  noises  and  parametric  uncertainties  of  the  synthesized 
reference  trajectory  are  modeled  with  a  thickened  trajectory.  To  guarantee  ro¬ 
bustness  the  thickened  trajectory  segment  is  restricted  within  the  flow  pipe  that 
contains  the  original  reference  trajectory  segment. 

The  planning  module  generates  a  plan,  i.e.,  the  reference  trajectory  along  with 
the  control  action  for  each  segment  of  the  reference  trajectory. 

2.3.2  Reference  trajectory  tracking 

The  tracking  module  follows  the  planned  trajectory  and  reactively  corrects  the  de¬ 
viation  due  to  uncertainties  in  dynamics  modeling,  disturbances,  etc.  The  tracking 
is  usually  local  to  the  region  of  the  phase  space  containing  the  trajectory  segment; 
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Figure  4:  Reference  trajectory  tracking 

see  Figure  4.  The  planned  reference  trajectory  provides  the  global  feedforward  ref¬ 
erence  term.  The  sensor  and  estimator  measure  the  actual  state,  whose  difference 
from  the  reference  term  is  the  local  feedback  correction  term. 

(1 )  Trajectory  tracking  and  reactive  correction 

The  Phase  Space  Navigator  tracks  a  synthesized  reference  trajectory  by  sensing 
the  state  of  the  system,  comparing  the  state  with  the  reference  trajectory,  and 
computing  the  current  cont-ol  action.  If  the  system  stays  within  the  tolerance 
of  the  reference  trajectory,  the  controller  simply  looks  up  for  a  control  action  in 
the  planned  trajectory.  If  the  system  deviates  from  the  desired  trajectory,  a  local 
linear  controller  is  switched  in  to  force  the  system  back  on  track.  The  trajectory 
correction  is  local  and  reactive.  It  is  designed  to  ensure  the  robustness  of  the  system 
in  the  presence  of  small  changes  in  system  parameters  and  noises  in  measurements. 

(2)  Model  updating  and  reference  trajectory  resynthesis 

The  tracking  module  estimates  the  model  parameters  through  the  observables 
and  updates  the  nominal  model.  When  the  dynamics  of  the  system  changes  sub¬ 
stantially  such  that  the  planned  global  path  is  no  longer  feasible,  the  Phase  Space 
Navigator  calls  the  planning  module  to  synthesize  a  new  reference  trajectory. 
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3  The  Automated  Control  Synthesis  Algorithm 

We  consider  a  control  system  x'  =  f(x,u),  where  x  €  Rn  is  the  state  variable 
and  u  €  Rm  is  the  control  parameter.  The  initial  state  of  the  system  is  x3  and 
the  goal  state  for  the  system  to  reach  is  xg.  The  set  of  admissible  values  for  the 
control  parameter  u  is  {u,-}.  The  planning  algorithm  searches  for  paths  from  x3  to 
xg,  subject  to  certain  design  constraints,  in  the  phase  spaces  indexed  by  control 
parameter  values  {**,-}.  The  tracking  algorithm  follows  the  synthesized  reference 
path.  Figure  5  shows  the  flow  chart  of  the  planning  algorithm. 

3.1  The  Trajectory  Planning  Algorithm 

1.  generating  qualitative  phase  space  descriptions  and  flow  pipes: 

Qualitatively  model  the  phase  space  structure  and  flow  pipes  for  each  of  the 
phase  portraits  indexed  by  {u,-}. 

2.  constructing  flow  pipe  graph: 

Paste  together  flow  pipes  from  the  phase  portraits  and  form  a  directed  graph, 
the  flow  pipe  graph,  where  nodes  are  the  intersections  of  flow  pipes  and  edges 
are  the  segments  of  flow  pipes.  In  particular,  x3  and  x3  are  nodes  in  the  graph. 

3.  weighing  flow  pipe  graph: 

Compute  weight  for  each  edge,  a  segment  of  a  flow  pipe,  according  to  trav¬ 
eling  time,  smoothness,  etc.  (The  weight  can  be  a  single  value  or  a  range  of 
values.) 

4.  clustering  path  components: 

Cluster  the  flow  pipe  graph  into  flow  pipe  path  components.  If  x3  and  xg  are 
in  the  same  flow  pipe  path  component,  go  to  step  7.  Otherwise,  the  goal  is 
unreachable  with  the  given  set  of  parameter  values  {«,•};  continue. 

5.  searching  for  partial  paths  from  x3  to  xg : 

Compute  the  reachable  set  Rx,  from  x3  and  the  reverse  reachable  set  RXg 
from  xg  in  the  flow  pipe  graph.  Denote  the  gap  between  R-J  and  RXg  to  be 
G  =  \\Rr,-Rz,\Y 

6.  tuning  control  parameter: 

Search  for  a  value  of  the  control  parameter  u  outside  {u,}  such  that  the 
corresponding  phase  portrait  has  flow  pipes,  when  added  to  the  flow  pipe 
graph,  reducing  the  value  of  G.  If  G  is  zero,  i.e.,  the  gap  between  Rr,  and 
Rxg  is  bridged,  collect  the  flow  pipe  paths  from  x3  to  xg  and  go  to  step  S. 
Otherwise,  repeat  step  6  until  no  more  parameter  values  to  search,  in  which 
case  return  the  collection  of  partial  flow  pipe  paths  from  x3  to  xg. 
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7.  finding  paths  from  x3  to  xg : 

If  looking  for  the  shortest  paths  from  x3  to  xg,  run  a  standard  shortest  path 
algorithm  on  the  graph.  If  searching  for  all  paths  p  from  x3  to  xg  subject  to 
the  constraint  C(p)  <  C-  use  the  following  algorithm. 

(^)  P global— paths  ~ 

(b)  Ppartial— paths  ~ 

(c)  for  each  path  p,-  €  Ppartiai-paths ,  ending  at  x,-,  do 

Ppi  =  { pi  *(x,-,x)|x  one  edge  reachable  from  x,*,  x  ^  xg, 

C(p{)  +  W(xitx)<Cy, 

Pp  —  {p:  *  (x,-,x)|x  one  edge  reachable  from  x,-,  x  =  xg, 
c(pi)  +  w(xi,x)<cy 

Pglahal— paths  =  Pglobal— paths  U  Pp/i 
P partial— paths  =  {.Ppartial— paths  Pi)  U  Pp; • 

(d)  if  Ppartial— paths  §OtO  (^c): 
otherwise,  continue. 

(e)  if  Pglobal— paths  =  gOtO  Step  5; 

otherwise,  continue. 

(f)  order  paths  in  Pgiobai-paths  according  to  their  costs  and  return. 

Note:  C{pi)  is  the  cost  of  a  path  p,-.  W{x{,Xj)  is  the  weight  of  the  edge 
(x;,Xj).  is  the  path  composition  operator. 

8.  generating  piecewise  trajectories: 

Select  a  representative  trajectory  segment  from  each  flow  pipe  forming  the 
flow  pipe  path  from  x3  to  xg  and  paste  them  together  at  intersections  of  flow 
pipes. 

9.  smoothing  trajectories: 

Smooth  each  intersection  of  trajectory  segments  through  trajectory  homo- 
topy  deformation.  Order  the  smoothed  trajectories  with  costs  and  return. 


3.2  The  Tr?  jectory  Tracking  Algorithm 

1.  sensing  and  estimating  the  physical  system: 

Sense  the  state  x  of  the  physical  system,  estimate  the  model  parameters,  and 
update  the  nominal  model  of  the  system.  If  x  is  in  the  neighborhood  of  the 
goal  x3,  go  to  step  5.  If  the  nominal  model  changes  substantially,  signal  and 
go  to  step  4.  Otherwise,  continue. 
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Figure  5:  The  flow  chart  of  the  trajectory  planning  algorithm 


2.  computing  control  action: 

Based  on  the  synthesized  refeience  trajectory  and  the  observation  from  step  1, 
compute  control  tracking  term  u£rae*  and  correction  term  uCOTTCCt.  If  ucorrert 
is  unobtainable  with  a  local  linear  correction,  check  alternative  paths  to  the 
goal  from  the  collection  of  suboptimal  paths.  If  the  goal  is  unreachable,  go 
to  step  4.  Otherwise,  the  control  action  is  u  =  utrac*  t  «correefj  continue. 

3.  generating  control  action: 

Tune  the  control  parameter  to  u£raeje  and  generate  u^reci  with  a  local  linear 
feedback  controller.  Drive  actuators  and  go  to  step  1. 

4.  resynthesizing  reference  trajectory: 

Call  the  planning  module  to  resynthesize  a  reference  trajectory  and  go  to 
step  2. 

5.  goal  stabilizer: 

Stabilize  the  system  at  xg  with  a  local  linear  controller,  subject  to  uncertain¬ 
ties  and  noises. 

3.3  Discussion  on  the  Algorithm 

1.  On-line  and  off-line  synthesis: 

The  planning  module  generates  smoothed  reference  trajectories  in  the  phase 
space.  The  constraints  on  response  time  and  the  availability  of  computational 
resources  dictate  whether  the  computation  of  planning  is  done  on-line  or 
off-line.  If  the  planning  is  done  off-line,  the  synthesized  plan  is  compiled 
into  a  table.  At  the  runtime  the  controller  performs  a  table  lookup  for  a 
control  action.  On-line  reference  trajectory  synthesis  and  tracking  require 
substantial  computational  power.  The  Supercomputer  Toolkit  (4)  provides  an 
ideal  platform  for  experimenting  with  the  above  control  synthesis  algorithm 
on  real-time  control  applications. 

In  step  1  of  the  planning  algorithm,  the  phase  space  structure  and  flow 
descriptions  can  be  either  computed  once  for  all  the  control  parameter  values 
or  generated  on  demand  in  the  search  for  optimal  paths.  The  later  case  will 
be  more  suitable  for  on-line  synthesis. 

2.  Search  for  optimal  paths: 

We  present  the  planning  algorithm  for  a  control  design  problem  with  one 
initial  state  and  one  goal  state.  The  algorithm  also  applies  to  control  systems 
of  “one  initial  state/manv  goal  states0,  “many  initial  states/one  goal  state-5, 
or  “many  initial  states/many  goal  states".  We  note  that  the  flow  pipe  graph 
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is  a  directed  graph  where  nodes  are  intersections  of  flow  pipes  and  edges 
are  segments  of  flow  pipes.  (The  graph  may  contain  cycles.)  The  search  for 
optimal  paths  is  formulated  as  a  search  for  shortest  paths  in  the  directed 
graph. 

For  a  “one  initial  state/one  goal  state”  or  “one  initial  state/many  goal  states” 
problem,  the  Dijkstra’s  algorithm  for  the  single  source  shortest  paths  in  a 
directed  graph  runs  in  0(E‘2)  for  a  graph  with  V  vertices  and  E  edges  [6]. 

A  “many  initial  states/one  goal  state”  problem  can  be  converted  to  the  “one 
initial  state/many  goal  states”  problem  by  reversing  the  directions  of  all 
edges;  and  a  “many  initial  states/many  goal  states”  problem  is  solved  by  the 
Floyd- Washall  algorithm  in  0(V3)  [6). 

3.  Suboptimal  paths: 

The  suboptimal  paths  can  be  useful  when  the  optimal  paths  are  no  longer 
judged  feasible.  They  can  be  stored  in  the  table  in  addition  to  the  optimal 
ones.  A  controller  can  opportunistically  choose  among  available  trajectory 
paths  according  to  the  desired  properties  at  a  control  switching  point. 

4.  discrete  vs.  continuous  control  parameter  spaces: 

A  version  of  the  synthesis  algorithm  is  presented  for  the  control  parameter 
initially  taking  values  from  a  finite  discrete  set.  An  example  of  finite-value 
control  systems  is  the  satellite  attitude  controller,  which  stabilizes  the  an¬ 
tenna  direction  of  the  satellite  subject  to  disturbances  by  turning  on  or  off 
high  thrust  jets.  Other  such  examples  are  switching  power  regulators. 

The  method  also  applies  to  control  systems  with  continuous  control  param¬ 
eter  spaces.  The  continuous-  parameter  space  is  first  sampled  at  discrete 
points.  Then  techniques  such  as  those  of  Abelson’s  bifurcation  interpreter  [3] 
can  be  employed  to  search  for  distinct  behaviors  at  finer  scales. 

4  Trajectory  Flow  Pipes 

The  flow  pipes  are  further  geometric  and  dynamical  characterizations  of  stability 
regions  of  the  phase  space.  They  form  the  foundation  for  the  search  algorithm  for 
synthesizing  global  paths. 

A  flow  pipe  models  a  collection  of  trajectories  exhibiting  the  same  qualitative 
features  in  the  phase  space.  It  is  an  equivalence  class  of  trajectory  paths,  each  of 
which  can  be  continuously  deformed  to  another  one,  delimited  by  the  boundaries 
of  stability  regions  and  trajectories  connecting  equilibria2. 

2  Formally,  a  flow  pipe  is  a  homotopy  equivalence  class  of  trajectory  paths  delimited  by  the 
boundaries  of  stability  regions  and  trajectories  connecting  equilibria.  If  the  ends  of  a  flow  pipe 
are  squeezed  to  points,  the  flow  pipe  is  a  path  homotopy  equivalence  class  of  trajectories. 
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Figure  6:  Classifying  faces  of  simplices  with  respect  to  flows 

Computationally,  a  flow  pipe  is  constructed  by  aggregating  consistent  geometric 
pieces  with  respect  to  the  way  that  the  flow  travels  from  piece  to  piece.  The 
catalogue  of  flow  types  on  boundaries  of  the  geometric  pieces  is  a  set  of  consistency 
constraints  derived  from  the  underlying  dynamics  of  the  vector  field. 

4.1  Constructing  flow  pipes 

MAPS  tessellates  a  phase  space  with  simplices  and  geometrically  models  the  sta¬ 
bility  regions  with  polyhedra.  Each  proper  face  /  of  a  simplex  s  is  classified  into 
one  of  the  three  types  with  respect  to  the  direction  of  the  flow  (see  Figure  6), 
according  to  the  following  labeling  scheme: 

1.  unidirectional  flow: 

label  in:  Trajectories  flow  into  $  on  /; 
label  out:  Trajectories  flow  out  of  s  on  /. 

2.  bidirectional  flow: 

label  in-and-out:  Trajectories  flow  into  s  on  some  portion  of  /  and  flow  out 
of  s  on  other  portion. 

3.  zero  flow: 

label  tangent:  The  flow  does  not  intersect  /.  Hence  the  proper  face  /  is  on 
the  boundary  of  the  flow. 

A  proper  face  of  a  simplex  is  called  monotonic  with  respect  to  both  the  flow 
and  the  simplex,  if  the  face  is  labeled  either  type  1  or  type  3  in  the  above  labeling 
scheme.  A  simplex  is  monotonic  with  respect  to  the  flow  if  all  of  its  proper  faces 
are  monotonic.  Similarly,  a  polyhedron  is  monotonic  if  all  of  its  proper  faces  are 
monotonic.  Given  two  simplices  agreeing  on  a  non-monotonic  proper  face,  the 
common  non-monotonic  face  is  canceled  in  the  polyhedron  formed  by  the  two 
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simplices;  see  Figure  7(a).  The  polyhedron  continues  to  grow  in  this  way  until  all 
of  its  proper  faces  are  monotonic.  The  monotonic  polyhedra  are  then  aggregated 
together  to  form  flow  pipes  in  such  a  way  as  to  conserve  the  flow  on  their  proper 
faces;  see  Figure  7(b).  The  following  algorithm  groups  simplices  into  smallest 
monotonic  polyhedra  and  clusters  monotonic  polyhedra  into  flow  pipes,  the  largest 
monotonic  polyhedra. 


Flow  Pipe  Construction  Algorithm 

1.  label  proper  faces  of  each  simplex  with  respect  to  the  flow. 

2.  cluster  simplices  into  monotonic  polyhedra: 

Cluster  simplices  into  equivalence  classes  with  the  equivalence  relation  on  the 
proper  faces  of  the  simplices:  two  simplices  are  equal  if  they  agree  on  a  proper 
face  with  the  same  flow  label:  in-and-out 

3.  cluster  monotonic  polyhedra  into  flow  pipes: 

Cluster  monotonic  polyhedra  into  equivalence  classes  with  the  equivalence  relation 
on  the  proper  faces  of  the  polyhedra:  two  monotonic  polyhedra  are  equal  if  they 
agree  on  a  proper  face  with  consistent  flow  directions:  in  and  out,  respectively. 

4.  order  the  monotonic  polyhedra  in  each  class  from  step  3  according  to  the  flow 
direction  and  form  a  flow  pipe,  return. 

The  resulting  flow  pipes  model  the  trajectory  flows.  The  boundary  of  a  flow 
pipe  is  formed  by  the  proper  faces  labeled  tangent  of  simplices  in  the  pipe. 

A  flow  pipe  contains  time  information  of  the  trajectory  flow.  The  monotonic 
polyhedral  pieces  in  a  flow  pipe  approximate  successive  flow  wave  fronts  indexed 
by  the  time.  The  flow  pipe  also  characterizes  the  Lyapunov  exponent  [8]  of  the 
flow  with  the  expansion  and  contraction  of  a  volume  element,  for  example,  the 
crossover  section  of  the  flow  pipe,  along  the  flow  pipe.  The  progressive  movement 
of  the  flow  wave  front  and  its  change  in  shape  quantitatively  describe  the  flow, 

4.2  Classifying  shapes  of  flow  pipes 

Flow  pipes  capture  dynamical  shapes.  A  flow  pipe  ends  at  an  attractor  and/or 
starts  from  a  repellor.  Saddles  can  not  be  in  a  flow  pipe  except  for  the  boundary 
of  the  pipe,  since  flow  pipes  are  further  decomposition  of  the  stability  regions.  In 
a  bounded  region,  each  flow  pipe  either  ends  at  an  attractor  or  leaves  the  region. 

Flow  pipes  exhibit  topological  shapes.  Each  flow  pipe  is  classified  according  to 
its  relative  shape.  A  flow  pipe  around  a  stable  limit  cycle  is  a  closed  pipe.  Most 
pipes  are  open.  The  open  pipes  can  be  relatively  straight  or  highly  wound.  One, 
two,  or  more  pipes  can  side  by  side  wind  into  a  spiral,  called  n-winding  pipes.  See 
Figure  8.  We- classify  the  pipes  into  the  following  categories: 

1.  open  pipes:  (a)  straight  pipes;  (b)  wound  pipes. 


14 


Figure  7:  Construction  of  flow  pipes:  (a)  grouping  simplices  to  form  a  monotonic 
polyhedron  by  canceling  common  non-monotonic  faces;  (b)  aggregating  monotonic 
polyhedra  to  form  a  flow  pipe  according  to  flow  directions  at  the  boundaries. 
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2.  closed  pipes. 

Figure  1(b)  of  Section  2.2  shows  a  single  pipe  winds  into  a  spiral  shape  at  its 
end  and  forms  the  stability  region  for  the  attractor.  More  complicated  patterns 
can  occur  with  more  pipes  or  in  higher  dimensions. 

5  An  Example:  Stabilizing  a  Buckling  Column 
5.1  The  column  model 

A  thin  elastic  column,  subject  to  axial  end  compressive  loads,  buckles  around 
the  principal  axis,  as  shown  in  Figure  9.  The  nonlinearity  is  introduced  by  the 
nonlinear  geometric  stiffness  of  the  column  [11].  Stoker  [15]  gave  a  simplified 
model  for  the  column  subject  to  axial  compressive  force  and  viscous  damping 

mx"  +  ex'  +  a\X  +  azxz  =  0,  (1) 

where  a\  =  A  4-  C  —  2 P/l  and  az  —  B  +  D  —  Pf 13.  The  state  x  is  the  characteristic 
measure  of  the  column  deflection  from  the  principal  axis  and  x'  is  the  velocity. 
The  column  has  mass  m  and  length  21.  The  axial  load  is  P,  and  the  coefficient  of 
viscous  damping  is  c.  The  bending  stiffness  is  modeled  by  a  primary  hard  spring 
with  restoring  force  Ax  +  Bxz  and  a  secondary  hard  spring  with  restoring  force 
Cx  +  Dxz.  We  rewrite  equation  (1)  as  a  system  of  first-order  equations 
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Figure  9:  Buckling  of  a  thin  elastic  steel  column  due  to  axial  end  loads. 


f  ®i  =  *2  ,2) 

\  A  =  a\Xi  -  a3x i  -  cx2),  K 

where  x\  represents  the  deflection  and  x2  represents  the  velocity. 

The  system  (2)  describes  the  buckling  motion  of  the  column  and  represents 
only  a  single  mode  of  vibration.  For  a  long  and  slender  column,  vibrations  are 
observed  to  occur  primarily  in  the  first  mode  [12], 

When  the  axial  load  P  is  less  than  the  critical  load  Pcriticai,  the  column  oscillates 
around  the  principal  axis  and  returns  to  the  vertical  state.  It  has  only  one  stable 
state  corresponding  to  the  attractor  at  the  origin  of  the  phase  space.  Under  a 
heavier  load,  i.e.,  P  >  Pcriticah  the  column  buckles  to  one  side  or  the  other.  The 
phase  space  has  a  saddle  at  the  origin  and  two  attractors  symmetrically  arranged 
about  the  saddle;  see  Figure  10.  The  positions  of  the  two  buckled  states  depend 
on  the  external  load  P ,  the  stiffness  of  the  column,  and  the  length  l.  The  larger 
the  load  is,  the  farther  away  the  buckled  states  are  from  the  principal  axis.  The 
column  breaks  when  the  buckling  exceeds  certain  limit.  As  P  surpasses  PCTiticaU 
the  column  undergoes  a  pitchfork  bifurcation  of  equilibria  [8]:  the  attractor  at  the 
origin  gives  birth  to  a  saddle  at  the  origin  and  two  attractors  on  two  sides. 

5.2  Extracting  and  representing  qualitative  phase  .space 
structure  of  the  buckling  column 

MAPS  automatically  analyzes  the  column  model  in  the  phase  space  and  extracts 
and  represents  the  qualitative  phase, space  structure  [17].  For  parameter  values 
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Figure  10:  The  phase  space  of  a  buckling  column  showing  the  stability  boundaries 
and  connecting  trajectories.  The  horizontal  axis  x  is  the  characteristic  measure 
of  the  column  displacement  from  its  principal  axis  and  the  vertical  axis  x'  is  the 
velocity. 

ai/m  =  —2.0,  a3/m  =  1.0,  and  c/m  =  0.2  and  phase  space  region  —3.0  <  x-i  <  3.0 
and  -4.0  <  x2  <  4.0,  the  program  reports  the  following  findings  and  represents 
them  in  a  relational  graph: 

<equilibrium-points : 

1 .  attractor  at : ( 1 . 41  0 . ) 

2.  saddle  at:(0.  0.) 

3.  attractor  at: (-1.41  0.)> 

<r  elat  ional-graph. : 

1.  3tability-region  for  attractor  at:*infinity*: 
stability-boundary : 

connecting-traj  ectories : 

2.  3tability-region  for  attractor  at: (1.41  •*.): 
stability-boundary : 

trajectory  l:(from  ^infinity*  to  (0.  0.)) 
trajectory  2:(frcn  "“infinity*  to  (»V.  0.)) 
connecting-traj ectorias : 

trajectory  3: (from  (0.  0.)  io  (1.41  0.)) 

3.  stability-region  for  attractor  at: (-1.41  0.): 
iitability-boundary : 

trajectory  l:(from  *infinity*  to  (0.  0.)) 
trajectory  2:(fro:a  *infinity*  to  (0.  0.)) 
connecting-trajectories: 
trajectory  4:(frora,(0.  0.)  to  (-1.41  0.))> 


Figure  11:  The  flow  pipe  for  the  attractor  on  the  left. 

The  program  finds  two  attractors  at  (1.41,0.0)  and.  (—1.41,0.0)  and  a  saddle 
at  the  origin.  It  generates  a  high-level  description  of  the  phase  space  geometry: 
two  banded  stability  regions  associated  with  the  two  attractors,  separated  by  the 
stable  trajectories  of  the  saddle  at  the  origin.  Figure  10  shows  stability  boundaries 
and  connecting  trajectories  of  two  stability  regions.  Based  on  the  geometric  phase 
space  representation,  tk?  phase  space  is  further  modeled  with  two  flow  pipes  formed 
by  aggregating  geometri'  pieces,  using  the  flow  pipe  construction  algorithm.  The 
pipe  boundaries  consist  of  separatiices  of  the  two  stability  regions  that  approach 
the  saddle  and  trajectories  that  connect  equilibria.  Figure  11  shows  the  flow  pipe 
that  ends  at  the  left-hand  attractor. 

5.3  Synthesizing  a  control  law  for  stabilizing  the  column 
in  the  phase  space 

We  want  to  stabilize  the  column  at  its  vertical  state  to  support  the  axial  end 
loads  and  to  prevent  it  from  breaking.  Under  sufficiently  heavy  load,  the  buckling 
motion  of  the  column  leads  the  column  to  one  of  the  buckled  states  and,  when  the 
states  are  far  away  from  the  principal  axis,  induces  the  failure  of  the  column. 

We  shall  focus  on  global  navigation  in  the  phase  space  that  synthesizes  global 
reference  trajectories  leading  to  the  desired  goal  state.  Since  the  design  of  local 
linear  controllers  is  relatively  well  understood,  we  shall  not  disrues  the  trade-offs 
among  different  designs  for  linear  controllers  and  shall  choose,  for  the  purpose  of 
demonstration,  a  simple  linear  feedback  design.  Local  controllable  regions  of  such 
linear  controllers  are  quantified,  given  available  control  strength,  and  are  used  in 
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constraining  the  design  of  global  control  paths. 

The  goal  sfate  of  the  control  is  the  unbuckled  state  —  the  saddle  at  the  origin 
of  the  phase  space  that  does  not  have  a  stability  region.  We  want  to  synthesize 
a  non- zero  stability  region  for  the  goal  and  maximize  the  legion.  The  controlled 
column  is  of  the  form 


[  A  =  z2 

1  X2  =  -  CX2)  +U,  V 

where  u  is  the  control.  In  the  m.  del,  mu  has  the  same  dimension  as  the  force. 

5.3.1  Synthesizing  a  local  goal  stabilizer 

When  the  system  gets  close  to  the  goal  state,  the  Phase  Spa 's  Navigator  synthesizes 
a  local  linear  controller,  say  a  PID  controller  [7],  to  stabilize  the  system  at  the  goal. 
The  available  control  strength  determines  a  neighborhood  UctT  of  the  goal  within 
which  the  local  linear  controller  suffices  to  stabilize  the  system  at  the  goal. 

Many  mechanical  systems  employ  force  control.  The  velocity  v  is  directly 
controllable,  whereas  the  position  x  is  indirectly  controllable  via  x'  =  v.  For 
example,  the  column  could  be  stabilized  against  buckling  with  a  force  exerted 
perpendiculai  to  the  principal  axis  near  the  center  of  the  column,  as  currently 
being  investigated  by  Andrew  Berlin.  The  controllable  subspace  of  the  phase 
space  of  the  column  is  in  the  direction  of  x2.  Since  the  unstable  subspace  of  the 
saddle  intersects  transversally  the  controllable  subspace  in  a  linear  neighborhood 
of  the  saddle,  the  goal  is  k  -ally  stabilizable  [10]. 

The  local  linear  controller  at  the  goal  is  specified  as 

Ulocal— linear  =  ~  kXX\  &2£2,  (4) 

where  kx,  k2  >  0  are  feedback  strengths.  The  values  for  kx,  k2  are  chosen  according 
to  the  available  control  authority  and  the  condition  that  the  line  U{ocai-iinear  —  0  is 
approximately  in  the  unstable  direction  of  the  saddle  in  its  neighborhood.  For  our 
purpose,  v/e  simply  let  kx  =  4.  IS,  k2  =  2.75.  The  control  strength  \uiocai-iimaA  is 
constrained  within  e,  i.e.,  |u(oca/_j,-near|  <  e.  This  constrains  the  controllable  region 
Uctr  of  the  local  linear  controller 

Uctr  Q  {(mi, m2)|e  <  kxxx  +  k2x2  <  e}. 

The  local  linear  region  around  the  saddle  is  also  determined,  within  which  local 
stable  and  unstable  eigenvectors  are  good  approximation  for  stable  and  unstable 
trajectories  of  the  saddle.  The  curvature  of  the  stable  and  unstable  trajectories 
characterizes  the  size  of  the  local  linear  region.  When  Uctr  is  v/ithin  the  linear 
region,  the  above  synthesized  linear  feedback  controller  suffices  to  stabilize  the 
system  at  the  goal. 
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5.3.2  Bending  flow  pipes  to  deform  trajectories 

The  Phase  Space  Navigator  automatically  synthesizes  a  smooth  global  trajectory 
from  an  initial  state  x,  to  the  goal  state  xg.  We  consider  two  cases  for  the  initial 
states  xs. 

(a)  The  column  is  initially  buckling  with  sufficient  velocity 

The  initial  state  xs  is  far  away  from  the  saddle  xg  in  the  phase  space  in  this 
case.  The  search  for  global  control  paths  to  the  goal  is  simple.  The  single  flow 
pipe  containing  x,  intersects  the  local  controllable  region  Uctr  of  xg.  To  synthesize 
a  trajectory  that  enters  Uctr ,  the  region  Uctr  is  backward  projected  in  flow  pipes 
to  form  a  goal  projection.  Then  the  trajectory  emanating  from  x3  is  deformed 
towards  the  goal  projection.  The  progrunmed  deformation  is  designed  to  push 
the  trajectory  towards  the  nearest  goal  projection  in  the  controllable  direction 
x2.  Since  the  control  is  more  effective  when  the  direction  of  the  vector  field  is 
relatively  orthogonal  to  the  'urection  of  the  controllable  direction,  switching  points 
are  inserted  to  turn  off  the  contrr  der  when  the  angle  between  the  two  directions 
are  below  some  threshold.  Figure  12(a)  shows,  for  example,  a  synthesized  reference 
trajectory  originating  at  (—1,  -3).  The  circles  indicate  places  where  the  control  of 
deformation  switches.  The  flow  pipe  that  leads  to  the  attractor  on  the  left  contains 
the  initial  state.  The  goal  is  on  the  boundary  of  the  pipe.  Given  the  maximum 
control  strength  for  the  linear  goal  stabilizer  (4)  ir.  Section  5.3.1,  the  controllable 
region  Uctr  is  bounded  by  the  region  specified  as 

|4.18xi  +  2.75s2|  <  0.2. 

In  this  demonstration,  we  choose  an  over-conservative  region  for  Uctr  determined 
by 


Uctr  =  {(xi,x2)  I  10.8351!  +  0.550x2|  <  0.04,  |0.797x!  -  0.605x2|  <  0.04},  (5) 

in  which  the  local  linear  approximation  of  the  vector  field  is  also  valid.  The  Uctr, 
formed  by  flow  pipes  too,  intersects  the  flow  pipe  that  ends  at  the  left  attractor,  as 
illustrated  in  Figure  13.  It  is  backward  projected  to  form  two  thin  pipes  enclosing 
the  boundaries  of  the  flow  pipe.  The  global  trajectory  emanating  from  (—1,  —3)  is 
pushed  towards  the  goal  projection  with  small  deformation  that  is  less  than  10% 
of  the  vector  field  strength  at  any  state  or  0.2  in  the  normalized  unit,  whichever  is 
smaller.  The  position  x  and  velocity  v  =  x‘  t.-C  tUe  controlled  column  are  plotted 
against  the  time  t  in  Figure  14(a)  and  (b),  respectively.  The  control  u  is  shown  in 
Figure  15. 

When  the  goal  is  not  reachable  with  the  current  deformation,  the  synthesis 
algorithm  tries  again  with  increased  control  strength  if  possible.  The  synthesized 
trajectories  could  bo  further  optimized  with  variational  techniques  on  the  collection 
of  trajectories  vv'shin  the  flow  pipe  segments  [5]. 
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(a)  (b) 

Figure  12:  The  synthesized  control  reference  trajectories  that  lead  to  the  unbuckled 
state  corresponding  to  the  saddle  at  the  origin  in  the  phase  space  of  the  column 
model:  (a)  starting  with  the  column  buckling  with  sufficient  velocity;  (b)  starting 
with  the  column  in  the  buckled  state  on  the  left. 


Figure  13:  The  portions  of  phase  spaces  for  the  uncontrolled  column  and  the  locally 
controlled  column. 
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Figure  14:  Performance  of  the  synthesized  control  reference  trajectory  for  the 
column  initially  buckling  with  sufficient  velocity:  (a)  position  x  of  the  column 
plotted  against  time  t\  (b)  velocity  y(=  x')  of  the  column  plotted  against  time  t. 


Figure  15:  Control  signal  u  for  stabilizing  the  column  initially  buckling  with  suffi¬ 
cient  velocity,  plotted  against  time  t. 
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(a)  (b) 

Figure  16:  Performance  of  the  synthesized  control  reference  trajectory  for  the 
column  starting  at  the  buckled  state:  (a)  position  x  of  the  column  plotted  against 
time  f;  (b)  velocity  v(=  x')  of  the  column  plotted  against  time  t. 

(b)  The  column  is  initially  buckled 

The  control  objective  here  is  to  pull  the  column  out  of  the  buckled  state  and 
bring  it  close  to  the  unbuckled  state.  Since  the  flow  pipe  containing  the  initial 
state  does  not  intersect  UctT,  a  different  control  strategy  must  be  employed.  The 
program  uses  the  following  strategy  to  synthesize  a  control  path. 

The  synthesized  reference  trajectory  is  shown  in  Figure  12(b).  Again,  the 
control  parameter  changes  at  places  marked  by  circles.  The  column  is  initially 
near  the  buckled  state  at  (—1.41,0.0).  Its  uncontrolled  trajectory  would  spirally 
approach  the  attractor  in  the  clockwise  fashion.  The  control  is  exerted  in  such  a 
way  as  to  swing  the  trajectory  away  from  the  buckled  state  to  approach  the  goal 
projection  of  the  saddle.  When  the  trajectory  intersects  the  goal  projection,  the 
control  is  switched  off  so  that  the  system  slides  along  the  uncontrolled  trajectory. 
As  soon  as  the  system  enters  Uctr  determined  by  (5),  the  linear  controller  is  switched 
in.  The  control  strength  is  less  than  half  of  the  vector  field  strength  or  1.0  in  the 
normalized  unit,  whichever  is  smaller.  The  position  x  and  velocity  v  =  x'  of  the 
controlled  column  are  plotted  against  the  time  t  in  Figure  16(a)  and  (b).  The 
control  u  is  shown  in  Figure  17. 

Tracking  the  synthesized  reference  trajectory  is  executed  in  the  same  way  that 
the  deformation  is  done. 
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5.4  The  phase  space  modeling  makes  the  global  naviga¬ 
tion  possible 

The  qualitative  description  of  the  phase  space  structure  and  the  geometric  mod¬ 
eling  of  the  trajectory  flows  provide  a  “map”  for  navigating  system  trajectory  to 
the  goal  in  the  phase  space. 

The  directed  graph  constructed  from  the  flow  pipes  is  used  in  searching  for 
global  paths  and  in  determining  whether  the  goal  is  reachable.  The  flow  pipes 
also  make  it  possible  to  characterize  the  more  microscopic  spatial  and  temporal 
relations  between  the  current  state  and  the  goal  state.  For  example,  the  collection 
of  simplices  near  the  goal  forms  a  neighborhood  of  the  goal  that  determines  whether 
or  not  the  trajectory  has  missed  the  goal  in  the  flow  pipe.  The  deformation  of 
global  path  segments  is  constrained  by  reasoning  about  the  spatial  relation  with 
flow  pipes. 


5.5  Other  control  problems 

The  stabilization  for  a  buckling  column  is  closely  related  to  the  pole-balancing 
problem,  where  the  control  objective  is  to  balance  a  pole  at  the  vertically  upward 
state.  In  the  phase  space  of  the  pole  model,  the  saddle  is  the  upward  state  of 
the  pole  and  the  two  attractors  on  two  sides  of  the  saddle  correspond  to  the  same 
downward  state  of  the  pole. 

The  control  objective  for  pole-balancing  is  to  bring  the  pole  to  the  upward  state 
from  any  other  positions,  even  when  the  pole  is  initially  hanging  downward.  The 
problem  has  been  studied  as  a  standard  testbed  for  many  control  designs,  most  of 
which  focus  primarily  on  linear  feedback  control  in  linear  regimes.  With  our  phase 
space  navigation  strategy,  a  globally  stable  reference  trajectory  can  be  synthesized, 
similar  to  the  one  for  the  buckling  column  discussed  earlier. 

The  optimal  design  of  such  trajectories  has  many  practical  implications.  For 
example,  the  efficiency  of  cargo  handling  work  at  shipyards  depends  largely  on 
the  operation  of  overhead  cranes  [13].  Figure  IS  illustrates  such  a  crane  driven 
by  a  trolley  drive  motor  and  a  hoist  motor.  The  bottleneck  of  the  ship  unloading 
operation  is  the  transfer  of  cargo  from  a  ship  to  waiting  trucks;  therefore,  mini¬ 
mizing  this  transfer  time  brings  about  a  large  cost  saving.  The  planar  motion  of 
the  load  hanging  at  the  moving  trolley  can  be  modeled  as  a  swinging  pendulum 
with  a  moving  fixed  point  and  a  hoisting  cable.  The  trajectories  of  such  overhead 
cranes  could  be  optimally  designed  with  the  Phase  Space  Navigator.  This  would 
be  an  important  step  towards  the  full  automation  of  cargo  handling  work  without 
a  crane  operator. 
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Figure  17:  Control  signal  u  for  stabilizing  the  column  starting  at  the  buckled  state, 
plotted  against  time  t. 


Figure  18:  The  mechanical  model  for  an  overhead  crane  unloading  cargo  from 
ships,  reproduced  from  (SakawafoShindo,  1982].  The  crane  is  equipped  with  a 
trolley  drive  motor  and  a  hoist  motor.  The  planar  motion  of  the  load  is  modeled 
as  a  swinging  pendulum  hanging  at  the  moving  trolley. 


6  Related  Wor.1'  n  Nonlinear  Control 

The  gain  scheduling  and  cellular  control  approaches  in  nonlinear  control  engineer¬ 
ing  are  related  to  our  phase  space  navigation  method. 

In  gain  scheduling  [14],  the  system  is  linearized  around  a  priori  selected  operat¬ 
ing  points.  A  linear  controller  is  designed  for  each  operating  point.  Interpolation 
is  used  in  regions  between  the  operating  points.  Gain  scheduling  is  an  open-loop 
adaptation  where  parameters  of  a  regulator  change  in  a  prespecified  way.  Autopi¬ 
lots  and  chemical  process  control  are  instances  of  gain  scheduling.  The  method 
requires  human  engineers  to  decompose  a  phase  space  into  locally  linear  regions. 
Our  method  differs  from  the  gain  scheduling  in  that  the  phase  space  is  decom¬ 
posed  automatically,  even  into  global  nonlinear  ones,  and  the  reference  trajectory 
is  synthesized  automatically  to  navigate  the  system  in  the  phase  space  based  on 
the  knowledge  of  the  phase  space  structure. 

The  cellular  control  method  is  based  on  the  cell  state  space  concept  [9].  It 
has  its  root  in  dynamic  programming.  The  state  space  is  first  discretized  into 
a  cell  space.  The  control  law  is  synthesized  with  respect  to  some  performance 
objective,  such  as  minimal  time.  Then  the  control  values  at  discrete  states  are 
programmed  into  a  table.  Run-time  control  is  to  perform  a  table-lookup.  This 
method  requires  an  exhaustive  search  for  control  paths  at  very  fine  grain.  The 
cost  in  high  dimensional  state  spaces  would  be  prohibitive.  The  Phase  Space 
Navigator,  however,  decomposes  the  phase  space  into  a  manageable  collection  of 
flow  pipes  and  searches  for  global  paths  in  this  collection. 

7  Conclusions 

We  have  developed  and  demonstrated  an  automated  control  synthesis  method,  the 
Phase  Space  Navigator,  for  synthesizing  controllers  for  nonlinear  systems  in  the 
phase  spaces.  We  have  discussed  primarily  finite-value  control  systems  as  examples 
to  illustrate  the  method  and  have  noted  that  the  method  also  applies  to  control 
systems  with  continuous  parameter  spaces. 

The  Phase  Space  Navigator  synthesizes  global  reference  trajectories  using  knowl¬ 
edge  of  phase  space  structures  provided  by  the  modeling  and  analysis  program.  A 
phase  space  is  modeled  with  flow  pipes  that  are  collections  of  trajectories  having 
the  same  qualitative  behaviors.  The  flow  pipes  provide  a  way  to  efficiently  search 
for  and  reason  about  global  structures  of  phase  spaces.  The  global  nonlinear  con¬ 
trol  synthesis  becomes  a  graph  search  problem  with  this  representation  of  the  phase 
space.  We  have  shown  how  global  reference  trajectories  can  be  automatically  de¬ 
signed  with  the  example  of  stabilizing  a  buckling  column. 

With  the  novel  idea  of  grouping  an  infinite  number  of  trajectories  into  a  man¬ 
ageable  collection  of  flow  pipes,  the  difficult  task  of  synthesizing  a  nonlinear  con¬ 
troller  is  formulated  as  a  computational  problem  that  requires  a  combination  of 
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substantial  numerical,  symbolic,  and  combinatorial  computations  and  spatial  rea¬ 
soning  techniques.  Such  a  formulation  provides  a  concrete  substrate  and  a  rich 
domain  for  exploring  and  developing  new  reasoning  and  representation  techniques. 
These  techniques  would  be  otherwise  hard  to  grasp  in  traditional  artificial  intelli¬ 
gence  research. 

The  leverage  of  our  automatic  synthesis  method  comes  from  applications  whose 
operating  regions  are  grossly  nonlinear  and  on  which  high-performance  global  con¬ 
trollers  are  impossible  to  synthesize  with  traditional  techniques.  Potential  en¬ 
gineering  applications  include  large  flexible  space  structures,  robot  manipulator 
planning,  satellite  attitude  control,  switching  power  regulators,  etc.  The  research 
in  automated  nonlinear  control  synthesis  that  actively  exploits  the  knowledge  of 
nonlinear  dynamics  in  phase  spaces  has  just  begun.  We  have  primarily  focused 
on  the  automatic  synthesis  for  global  nonlinear  controllers  and  have  not  addressed 
the  important  issues  of  modeling,  sensing,  estimation,  and  high-quality  linear  con¬ 
troller  design.  The  exploration  and  development  of  a  collection  of  new  methods 
tackling  these  issues  in  all  dimensions  can  revolutionize  the  synthesis  and  analysis 
of  high-performance  nonlinear  control  systems. 
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